Information filtering via biased heat conduction 
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Heat conduction process has recently found its application in personalized recommendation [T. Zhou et at, 
PNAS 107, 45 1 1 (2010)], which is of high diversity but low accuracy. By decreasing the temperatures of small- 
degree objects, we present an improved algorithm, called biased heat conduction (BHC), which could simulta- 
neously enhance the accuracy and diversity. Extensive experimental analyses demonstrate that the accuracy on 
MovieLens, Netflix and Delicious datasets could be improved by 43.5%, 55.4% and 19.2% compared with the 
standard heat conduction algorithm, and the diversity is also increased or approximately unchanged. Further 
statistical analyses suggest that the present algorithm could simultaneously identify users' mainstream and spe- 
cial tastes, resulting in better performance than the standard heat conduction algorithm. This work provides a 
creditable way for highly efficient information filtering. 



PACS numbers: 89.20.Hh, 89.75.Hc, 05.70.Ln 

With the advent of the Internet [Q]] and wide application 
of Web 2.0 techniques, there sprout many web sites that en- 
able large communities to aggregate and interact. For exam- 
ple, Twitter allows its 1.7 x 10 8 members to share interests 
and life experiences, Facebook has already exceeded 500 mil- 
lion members since July 16th, 2010, and their members are 
growing ever faster. This brings massive amount of accessible 
information, more than every individual's ability to process. 
Searching, filtering and recommending thus become indis- 
pensable in the Internet era, in which the personalized recom- 
mender systems have become an effective tool to address the 
information overload problem by predicting users' interests 
and habits based on their historical records. Personalized rec- 
ommender systems have been used to recommend books and 
CDs at Amazon.com, movies at Netflix.com, and news at Ver- 
sifi Technologies (formerly AdaptiveInfo.com) 01 . Motivated 
by the practical significance to e-commerce, recommender 
systems have caught increasing attention and become an es- 
sential issue JH lift. A personalized recommender system in- 
cludes three parts: data collection, model analysis and recom- 
mender algorithm, where the algorithm is the core part. Thus 
far, various kinds of algorithms have been proposed, including 
collaborative filtering (CF) approaches O-UOll. c ontent-based 
analyses flllfl^il . ta g-aw are algorithms 1 1 3l4l 5ll . link predic- 
tion approaches Il6l4l8ll . hybrid algorithms Il9 i |20I | . and so 
on. For a review of current progress, see Refs. |2|,|2j]] and the 
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references therein. 




FIG. 1 : (Color online) Illustration of heat conduction algorithm on a 
bipartite user-object network: (a) The objects collected by the target 
user are activated, with temperature 1, while others are of temper- 
ature 0. (b) Each user's temperature is the average over all her/his 
collected objects, (c) Same process happens from users to objects. 

A recommender system could be described by a bipartite 
network ll22[|23ll . in which there are two kinds of nodes: users 
U and objects O. The users' historical records are represented 
by the edges connecting users and objects. Supposing there 
are m objects O = {01,02, • • ■ , o m } and n users U = {1*1,1/2, 
• • ■ , u n }, the system can be fully described by an adjacency 
matrix A = {a/ Q } mi „, where a; Q = 1 if o a is collected 
by ui, and a; Q = otherwise. A reasonable assumption is 
that objects collected by users are what these users like and a 
recommendation algorithm aims at predicting users' pers onal 
opinions on the objects they have not yet collected I24H26I1 . 
In the standard heat conduction (HC) algorithm, we first con- 
struct a propagator matrix W , where the element w a p de- 
notes the conduction rate from object op to o a . Denote H as 
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the temperature vector of m components: the source compo- 
nents are of temperature 1, while the remaining components 
are of temperature 0. Then the temperatures associated with 
the remaining nodes could be calculated by solving the ther- 
mal equilibrium equation W h H = f 12611 . where f is the flux 
vector. This is the discrete analog of -KV 2 T(r) = V • J{r), 
where k is the thermal conductivity, V 2 T(r) is the tempera- 
ture gradient and V • J(r) is the local heat flux. In this pa- 
per, H(i) plays the role of —nT(r) and f (i) plays the role of 
V • J{r) $Z($\ . In the standard HC algorithm, the temperature 
of the collected objects is constant, and the heat will diffuse 
from objects to users, and then from users to objects. The 
temperatures of the uncollected objects are then considered 
as recommendation scores: the objects given higher tempera- 
tures would be recommended preferentially (see Fig.l for an 
illustration). Since HC algorithm B26I1 is implemented based 
on matrix operations, it is very time-consuming and cannot be 
applied to large-scale systems. Zhou et a/.|01 proposed a lo- 
cal HC algorithm, which spreads the heat on the user-object 
bipartite network and can quickly generate highly diverse yet 
less accurate recommendations. As a benchmark for compar- 
ison, we call it standard HC algorithm (hereinafter, HC only 
stands for local heat conduction algorithm |0]). 

In this Brief Report, we present the biased heat conduc- 
tion (BHC) algorithm to see how objects' degrees affect the 
algorithmic performance. Using data from three real sys- 
tems (MovieLens, Netflix and Delicious), we show that giving 
higher temperatures to the large-degree objects than the stan- 
dard HC algorithm could generate highly accurate and diverse 
recommendations . 

To test the performance of a recommendation algorithm, we 
randomly divide a given data set into two parts: the training 
set and the probe set. The information contained in the probe 
set is not allowed to be used for recommendation, namely we 
provide a recommendation list for each user only based on 
the training set. In this Brief Report, we always keep 90% of 
links in the training set and 10% of links in the probe set, and 
employ three different metrics to measure accuracy, novelty 
and diversity of recommendations. 

Accuracy [25]. A good recommender algorithm should 
rank preferable objects that match the user tastes in higher po- 
sitions, i.e., the objects in the probe set (indeed being collected 
by users) should be put in high positions of the recommenda- 
tion list. For a user Ui, if the entry Ui-Oj is in the probe set, 
we measure the position of Oj in the ordered list for ui. For 
example, if there are 100 uncollected objects for Ui and Oj is 
the 3rd one from the top, we say the position of Oj is 3/100, 
denoted by ry = 0.03. A good algorithm is expected to give 
small Tij . Therefore, the mean value of the position (r) over 
all entries in the probe set can be used to evaluate the algorith- 
mic accuracy: the smaller the average ranking score 12511 . the 
higher the algorithmic accuracy. 

Novelty and diversity B27I1 . Since there are countless chan- 
nels to obtain popular objects' information, uncovering very 
specific preference, corresponding to unpopular ones, is much 
more significant than simply picking out what a user likes 
from the list of the best sellers |4j]. To measure this fac- 
tor, we go simultaneously in two directions: novelty (mea- 



TABLE I: Basic statistics of the tested data sets. 



Data Sets 


Users Objects 


Links 


Sparsity 


MovieLens 


1,574 943 


82,520 


5.56 x 10" 2 


Netflix 


10,000 6,000 


701,947 


1.17 x 10~ 2 


Delicious 


10,000 232,657 


1,233,997 5.30 x 10" 4 



TABLE II: Algorithmic performance for MovieLens, Netflix and De- 
licious data sets on the standard HC algorithm |4|]. The popularity 
(fc) and diversity S are obtained at L = 10. 



Data Sets (r) (fc) S 

MovieLens 0.15156 3.085 0.88196 

Netflix 0.10629 1.344 0.86296 

Delicious 0.26129 1.915 0.98066 



sured by popularity) and diversity (measured by Hamming 
distance). The popularity is defined as average degree of all 
recommended objects, (fc). Since it's hard for the users to 
find the unpopular objects, a good algorithm should prefer to 
recommend small average objects. In addition, the personal- 
ized recommendation algorithm should present different rec- 
ommendation lists to different users according to their tastes 
and habits. The diversity is quantified by the Hamming dis- 
tance S = (Hij), where = 1 — Qij(L)/L, with L is 
the length of recommendation list and Qij(L) is the number 
of overlapped objects in u$'s and Uj's recommendation lists. 
The larger S corresponds to higher diversity. 

Three benchmark datasets, named MovieLens, Netflix and 
Delicious (See Table 1 for basic statistics), are used to test the 
present algorithm. The Netflix data set is a randomly sample 
of huge dataset provided for the Netflix Prize ll30ll . and the 
Delicious data set is obtained by downloading publicly avail- 
able data from the social bookmarking web site Delicious.com 
(taking care to anonymize user identity in the process). The 
Delicious data is inherently unary while both MovieLens and 
Netflix data sets contain explicit ratings from one to five. We 
apply a coarse-graining method to transform them into unary 
forms: an object is considered to be collected by a user only if 
the given rating is larger than 2. The sparsity of the data sets 
is defined as the number of links divided by the total number 
of user-object pairs. 

Applying the standard HC algorithm on MovieLens, Net- 
flix and Delicious data sets, (r), (fc) and S are shown in Table 
II. One can find that although the accuracy of the standard 
HC algorithm is poor, it provides highly diverse recommen- 
dations. We argue that the less accuracy of the standard HC 
algorithm lies in the fact that it assigns overwhelming priority 
to the small-degree objects, leading to strong bias. Therefore, 
the standard HC algorithm could be improved by reinforcing 
the influence of the large-degree objects. In the last step of 
the standard HC algorithm, all of the heat an object has re- 
ceived is divided by its degree. Although the large-degree ob- 
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FIG. 2: (Color online) Performance of the BHC algorithm on Movie- 
Lens, Netflix and Delicious data sets. The plots (a)-(c) show av- 
erage ranking score (r) vs. A. Subject to (r), the optimal A op t are 
0.84, 0.85 and 0.50, and the corresponding (r) opt are 0.0852, 0.0474, 
0.2112. The plots (d)-(f) display the results for (k) and (g)-(i) for S 
with L = 10. All the data points are averaged over ten independent 
runs with different divisions of training-probe sets. 

jects could receive lots of heat, their temperatures are very 
low, while small-degree objects would obtain high tempera- 
tures and thus be put in the top positions of recommendation 
lists. A clear advantage of the standard HC algorithm is its 
ability to dig out the unmainstream tastes that almost can not 
be found by classical methods. However, users generally like 
popular objects and thus an algorithm should also give chance 
to them. We therefore propose the BHC algorithm taking into 
account the object degree effect in the last diffusion step. To 
an target object o a , instead of dividing by its degree k(o a ), the 
final temperature is obtained dividing by k A [p a ) . The element 
w a p of the matrix W h would be w af} = Ya=i TEf^f • 

Comparing with the standard HC algorithm (i.e., A = 1), the 
influences of large-degree objects would be strengthened if 
A < 1 or depressed if A > 1. 

A summary of the primary results for BHC algorithm is 
given in Table III. Figure |2](a-c) report the algorithmic ac- 
curacy (r) as a function of A, from which one can find that 
the curves obtained by BHC have clear minimums. For ex- 
ample, the optimal parameter of MovieLens data is around 
A op t = 0.84, strongly supporting our argument that the effects 
of large-degree objects should be increased. Compared with 
the standard case (i.e. A = 1), the average ranking score (r) 
is reduced from 0.1516 to 0.0852 (improved by 43.5%). This 
results indicate that giving more opportunities to the large- 
degree objects will greatly increase the algorithmic accuracy. 
More interestingly, when L = 10, the Hamming distance of 
MovieLens is also improved from 0.8820 to 0.9248 (see Fig. 
Oi)), which is even better than 0.9173 obtained by the hybird 
algorithm [4]. Actually, the standard HC algorithm prefers 
to give more opportunities to the small-degree objects and 
ranks them at the top positions of many users' recommenda- 
tion lists. Therefore, the Hamming distance may not be the 



FIG. 3: The plot (a) shows the object degree distribution of Net- 
flix data, and (b)-(d) show the correlations between the occurrence 
number n(k) and the object degree k of MD, standard HC and BHC 
algorithms when L — 10. The results of MovieLens and Delicious 
are similar. 



TABLE III: Algorithmic performance on BHC algorithm. The Ham- 
ming distance is corresponding to L = 10. 



Data Sets 


A op t 


{r opt ) Improvement 


5*opt 


MovieLens 


0.84 


0.0852 43.5% 


0.9248 


Netflix 


0.85 


0.0474 55.4% 


0.8200 


Delicious 


0.50 


0.2112 19.2% 


0.9795 



highest although the popularity is the lowest. Figure 2(b,e,h) 
show the similar results on Netflix, where the optimal param- 
eter is Aopt = 0.85. Results of MovieLens and Netflix are 
very close to each other, with the fact that both data sets are 
movie-related and the sparsity is close. The optimal parame- 
ter A op t on Delicious (See Fig.2(a,d,g)) equals 0.5, with very 
small (k) and very high S (« 0.98). Both the optimal ranking 
score (r)opt = 0.2112 and the Hamming distance S = 0.9795 
of Delicious are much larger than the ones of MovieLens and 
Netflix. The results are twofold: the higher sparsity of edges 
and the larger number of objects. The former leads to less 
accurate recommendation while the latter results in higher di- 
versity. 

Table IV reports the performances obtained by several al- 
gorithms on MovieLens dataset, from which one can find the 
accuracy (r) of BHC algorithm is close to the result of HO-CF 
algorithm which needs to compute the second-order similarity 
information, and the diversity of BHC algorithm is the high- 
est one. In order to explain the reasons why both accuracy and 
diversity can be enhanced by BHC algorithm, the frequencies 
of appearances n{k) of objects of degree k in all users' rec- 
ommendation lists are investigated. We show the results of 
a typical example, Netflix, where the length of recommen- 
dation list is L = 10. Different from the power-law degree 
distribution in Figj3ja), n(k) of BHC algorithm has butterfly 
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TABLE IV: Algorithmic performance for MovieLens data, (k) and 
S are corresponding to L — 10. MD is abbreviations of the algo- 
rithms proposed in Ref. (H, Heter-NBI, HO-CF, IMCF and WHC 
are abbreviations of algorithms with heterogeneous initial resource 
distribution proposed in Ref. 12711 . high-order collaborative filtering 
(CF) algorithm proposed in Ref. [28], improved modified CF algo- 
rithm in Ref. [29] and the algorithm presented in Ref. 12311 . 

Algorithms (r) S (k) 

MD 0.1060 0.617 233 

HC 0.1516 0.750 3.09 

Heter-NBI 0.1010 0.682 220 

HO-CF 0.0826 0.9127 237 

IMCF 0.0877 0.826 175 

WHC 0.0914 0.941 179 

BHC 0.0852 0.925 197 



shape, which means that the objects with large or small de- 
grees are recommended more frequently. Figure [3jb) shows 
that mass diffusion algorithm prefers to recommend the large- 
degree objects, while Fig. [3]c) shows that the standard HC 
algorithm gives higher recommendation scores to the small- 
degree objects, thus the popular objects are largely depreci- 
ated. Comparing Fig. [3jc) with Fig. |3d), at the optimal case 
A pt = 0.85, both small-degree and large-degree objects are 
recommended with high frequency by the BHC algorithm. In 
a word, the advantage of BHC is that it could not only dig out 
the users' very special tastes, but also find out the common 
interesting objects. 

In this Brief Report, we propose a biased heat conduction 
algorithm by considering the degree effects in the last step 
of the local heat conduction process [4], which could greatly 
improve the accuracy of the standard HC algorithm. In the 



standard HC algorithm, the small-degree objects are recom- 
mended overwhelmingly because in the last step, to calculate 
the temperature, the received heat is divided by the object de- 
gree. This division largely depresses the chance of a large- 
degree object to be recommended. In contrast, the power- 
law object degree distribution indicates that large-degree ob- 
jects are preferred by many users, therefore a good algorithm 
should also pay attention to the them. In addition, a per- 
sonalized recommender system should provide each user rec- 
ommendations according to his/her own interests and habits. 
Therefore the diversity of recommendation lists plays a cru- 
cial role to quantify the personalization. The numerical results 
show that the recommendation lists generated by the BHC al- 
gorithm are of competitively higher diversity and remarkably 
higher accuracy than those generated by the standard HC al- 
gorithm. The statistical results on Facebook applications also 
show that the objects could be divided into two categories 
i3lll . One of them is collected by almost all of users, while 
others are only collected by small-size group users, which in- 
dicates that the users' tastes could be expressed by two cat- 
egories: popular one and special one. Therefore, the reason 
why BHC could produce higher accuracy is that users' two 
kinds of interests could be simultaneously identified. How- 
ever, how to timely track users' current popular and special 
tastes is still an open problem. 
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